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ABSTRACT 

How important are friendships in determining success by in- 
dividuals and teams in complex collaborative environments? 
By combining a novel data set containing the dynamics of 
millions of ad hoc teams from the popular multiplayer online 
first person shooter Halo: Reach with survey data on player 
demographics, play style, psychometrics and friendships de- 
rived from an anonymous online survey, we investigate the 
impact of friendship on collaborative and competitive perfor- 
mance. In addition to finding significant differences in player 
behavior across these variables, we find that friendships ex- 
ert a strong influence, leading to both improved individual 
and team performance — even after controlling for the over- 
all expertise of the team — and increased pro-social behaviors. 
Players also structure their in-game activities around social 
opportunities, and as a result hidden friendship ties can be 
accurately inferred directly from behavioral time series. Vir- 
tual environments that enable such friendship effects will thus 
likely see improved collaboration and competition. 
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INTRODUCTION 

Given the ubiquity of friendships in social interactions and 
complex social systems, the value of any particular friend- 
ship can be difficult to quantify. But some things are known: 
friendships are useful for finding new jobs, as in the "weak 
ties" or Granovetter effect [16]; they are useful for marketing 
because personal attributes — including fixed variables like 
age, ethnicity and location, as well as more fluid variables 
like preferences and opinions — exhibit homophily [27, 36] 
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and thus correlate across friendship ties; and, they are use- 
ful for social filtering or search [4,6], i.e., your friends are 
good at predicting what information you will like. 

But, how valuable are friendships within complex collabora- 
tive environments, particularly those found online? On the 
one hand, teams composed of friends may perform better as 
a result of their extensive collaboration history, i.e., friend- 
ships may increase awareness and understanding of others' 
goals and motivations [33] or they may increase commitment 
to group objectives [22], thereby facilitating more success- 
ful collaboration. Friendships may also reduce within-group 
conflict [21], yielding similar results. On the other hand, 
friendships could detract from performance because friends 
spend less time focusing on group objectives and more time 
socializing [35]. 

Friendship may even be irrelevant for effective collaboration. 
If a competition's outcome depends mainly on team coor- 
dination, teams composed of experts may coordinate effec- 
tively regardless of within-team friendships: highly practiced 
individuals may simply know from experience how to work 
well with other experts and thus naturally anticipate or adapt 
to their teammates' actions (e.g., by fitting into established 
and effective team roles). In such a case, friendships do not 
matter and teams composed of skilled strangers will perform 
best [35]. Thus, the extent to which expertise and friendship 
matter to success is an open question in complex collabora- 
tive environments, particularly those found online. 

Here, we focus on the topic of friendships, collaboration and 
competition in online game environments, and specifically 
in the first-person shooter (FPS) genre, in which teams of 
players compete against each other in non-persistent virtual 
worlds. We are also interested in the variety of social dynam- 
ics naturally observed in this complex environment, and the 
extent to which they vary with demographic and psychome- 
tric variables. There are many studies in CSCW on the de- 
sign of, and human action within, multi-user dungeons [29], 
massively multiplayer online games (MMOGs) [5] (among 
others), social virtual worlds [7], and non-persistent game 
sites [26]. Work on friendships in online gaming has focused 
on how friendships and social relationships are a motivation 
to play games [8, 37] or how playing games builds offline re- 
lationships and social capital of the players [1, 19]. 

Work on guilds and parties [39] in massively multiplayer on- 
line role-playing games (MMORPGs) suggests that teams are 
important for individual satisfaction and performance, and 
anecdotal evidence suggests they are conducive to the for- 
mation of friendships. However, the impact of friendships 



themselves on team or individual performance has not been 
studied quantitatively (but see [19] for some qualitative in- 
sights). And, work on first-person shooter type games has 
generally focused on qualitative analysis of user actions and 
in-game communication [23,32,42]. 

Most work on the utility of friendships in teams is found in the 
management literature. Several such studies argue that friend- 
ships play a significant role in mitigating certain types of 
within-group conflicts; thus, teams of friends perform better 
because internal conflict handicaps performance [21, 22, 35]. 
Others have focused on designing effective teams in busi- 
ness or educational environments [2, 3, 34]. In these settings, 
friendships seem to improve group cohesion, satisfaction, and 
some measures of performance. Thus, we expect friendships 
to play a significant role in the collaboration effectiveness in 
virtual environments, including those found in the team-based 
competitive environments of the FPS genre. 

Finally, the increasing CSCW interest in "serious games" [28] 
as a model for creating effective work spaces makes the study 
of real online games, and particularly those that allow col- 
laboration with both friends and strangers, a valuable labora- 
tory by which to understand the dynamics and determinants 
of collaboration and sociality in virtual environments. 

Here, we study these questions in the context of the popu- 
lar online FPS game Halo: Reach. We combine two large 
and novel data sets: one composed of rich online behav- 
ioral samples from the events within Reach competitions and 
one composed of rich demographic, play style, psychomet- 
ric and friendship variables collected from players of Reach 
through an anonymous online survey. This combination of 
in-game behavioral data with survey data on the same partic- 
ipants has been used productively to study human behavior 
in MMORPGs [40,44] and online parlor games [26], but this 
study is the first time it has been applied to the FPS genre or 
to the question of friendship, collaboration and competition. 

The survey asked participants to label their online and offline 
friendships with other Reach players, and these data allow 
us to test the hypothesis that friendships have a direct and 
significant impact on the results of these competitions. Fea- 
tures within the game allow us to control for the impact of ex- 
pertise. An in-game "matchmaking" system draws groups of 
friends and individual players from the general population to 
create competitions; thus, players collaborate with and com- 
pete against a wide variety of teammates and competitors. Fi- 
nally, we note that the sheer size of Reach, which is played by 
millions of individuals, provides a wealth of data by which to 
investigate these and other questions about collaboration and 
competition. 

We find that friendships significantly improve both individual 
and team performance in these complex collaborative envi- 
ronments. Friendships are not as important to success as the 
raw expertise of teammates (friendship does not compensate 
for being a bad player), but individuals themselves perform 
better the more friends they play with, independent of their 
team's performance. Our survey shows that Reach players 
are also highly social, tending to prefer team-oriented play 



and experiencing a strong sense of group cohesiveness and 
coordination. Furthermore, older players (24 or older; 30% of 
players) tend to be more socially oriented and exhibit greater 
proclivity for pro-social behaviors. Above and beyond per- 
formance, friendships also reshape the style of play within 
a competition, and players structure their choices around op- 
portunities to play repeatedly with friends. As a result, friend- 
ships can be accurately inferred directly from behavioral time 
seres data. These results illustrate the strong role that friend- 
ships play in complex collaborative and competitive virtual 
environments. They also shed light on social dynamics within 
FPS games and suggest algorithms designed to account for 
the benefits of friendship in predicting or designing competi- 
tive environments. 

METHODOLOGY 

Game Mechanics and Data 

Halo: Reach is a multiplayer online first person shooter game 
played by more than ten million people worldwide. It was 
publicly released by Bungie Inc., a former subdivision of Mi- 
crosoft Game Studios, on 14 September 2010 and Reach play- 
ers have now generated more than 1 billion games. Players 
choose from among seven game types and numerous sub- 
types, which are played on more than 33 terrain maps with 
74 weapons. Games can be played alone, with or against 
other players via the Xbox Live online system. Both indi- 
vidual game and player summaries are available through the 
Halo Reach Stats API. Through this interface, we collected 
the player details of all individuals who participated in our 
online survey and each of their full game histories, which 
yielded 2,445,617 complete games. 

Among other information, each game file includes the se- 
quence of the scoring events at the per-second resolution and 
a list of players by team. Scoring events are annotated with 
the name of the player generating the event (a unique Xbox 
Live gamertag), the number of points scored and the player 
giving up the points (if applicable). 

Unlike professional sports [15], teams in Reach do not per- 
sist across competitions. Instead, each time a competition is 
created, individuals or small "parties" of players (typically 
friends) are grouped into teams by an in-game "matchmak- 
ing" algorithm; when the competition is complete, the play- 
ers (or a subset) may choose to play together again as a party, 
or may reenter the matchmaking process to find new team- 
mates or competitors. The result is that players routinely col- 
laborate with or compete against different strangers in suc- 
cessive games. Reach's matchmaking system uses an algo- 
rithm called TrueSkill [18] to ensure equal total "skill" for 
both teams, which allows us to control for this variable in our 
subsequent analysis. 

Among the sampled games, there are three basic types: cam- 
paign games, a sequence of story-driven, non-competitive, 
player-versus-environment (PvE) maps that many players 
complete prior to trying other types of games ;firefight games 
(also PvE), in which a team of human-controlled players bat- 
tle successive waves of computer-controlled enemies; and 
competitive games, a player-versus-player (PvP) game type, 
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Figure 1, Age distribution of the survey respondents. 

in which teams of equal size (typically 2, 4, 6 or 8 players) 
compete to either be the first to reach some fixed number of 
points or have the largest score after a fixed length of time. 
(The precise number of players per team, points required to 
win and length of a game depends on the game subtype.) In 
competitive games, players' avatars move through the game 
map simultaneously, in real-time, navigating complex terrain, 
acquiring avatar modifications and encountering opponents. 
Points are scored by dealing sufficient damage to eliminate 
an opposing avatar and for each such success, a team gains a 
single point. Eliminated players must then wait several sec- 
onds before their avatar is placed back into the game at one 
of several specified "spawn" locations. 

Survey Data 

In addition to the in-game behavioral data available through 
the API, we collected data on 1191 unique Reach players via 
an anonymous online survey. 1 The survey design incorpo- 
rated extensive feedback from expert Reach players, includ- 
ing ourselves, to better frame questions in appropriate lan- 
guage and to focus on relevant social dynamics. 

We advertised the survey through Halo-centric online forums 
and other community websites. While this did result in a bi- 
ased sample (see below), the bias was towards participants 
who were active and therefore had more in-game data to draw 
upon. Participants supplied their Xbox Live gamertag, which 
we used to download their Reach player details and their en- 
tire Reach game history up to the date of survey participa- 
tion. Within the survey, an initial screening was conducted 
based on participant age: participants who indicated their age 
was under 18 were routed through an additional email-based 
parental consent step. 



The player detail files contain only summary statistics about 
Reach game play and do not contain demographic or social 
network information. The survey solicited these missing de- 
mographic variables (age, sex, location, highest education 
level, primary language), along with preferred play style (lone 
wolf, team leader, or team support), and psychometric re- 
sponses like measures of group cohesion [38], entativity [25], 
and conflict [31]. In answering these questions, respondents 
were instructed to consider their "primary Halo group," de- 
fined as the group of friends with whom they primarily play 
Reach at the time of the survey. Finally, respondents indi- 
cated their relationship (online friend, offline friend or not a 
friend) with every other unique gametag that appeared in their 
game history. Because this list could include hundreds or 
even thousands of gamertags, it was sorted by the frequency 
of co-plays, which placed the most likely true positives at the 
top. The interpretation of the meaning of an online and offline 
friend was left to the participant, although we suggested on- 
line friends were people "you play with regularly and would 
say you know at least casually." This is conceptually distinct 
from any feature available on Halo or XBox for indicating 
friendships. 

PLAYER ATTRIBUTES 

Demographics 

As is typical with many online games [41] and especially first 
person shooters [20], our survey respondents were mostly 
young men. The median age of our participants was 20, 
which is considerably lower than the mean age of 37 for video 
game players overall [14]. Of the 1191 respondents, an over- 
whelming majority (94.9%) reported their sex as male (com- 
pared with 80.8% male in MMO players [41] and 58% re- 
ported by the industry overall [14]). The reported age distri- 
bution of the respondents is relatively smooth, with the excep- 
tion of a large spike at 18 (see Figure 1), presumably caused 
by individuals younger than 18 misreporting their age so as 
to avoid the parental consent requirement. 2 

This contrasts with the reported age distribution of MMO 
games, in which the plurality of players are in their 30s rather 
than their 20s [41]. However, the age distribution also ex- 
hibits a long right tail, with 12.8% being at least 30 years 
old. In agreement with the age distribution, the most frequent 
response indicated some college education. Although 35 
countries were represented in the respondents, nearly three- 
quarters of respondents were from the United States, with an- 
other 14% from Canada and the United Kingdom. An over- 
whelming majority — 94% of our respondents — indicated En- 
glish was their first language. 

Our respondents were very active players: they reported play- 
ing video games for an average of 23.3 hours per week. Al- 
though this number may appear high to non-gamers, it is 
slightly lower than the 25.9 hours per week reported by MMO 



The survey was open until November 2012, when Bungie, 
Inc. turned off the API. The anonymous design did not permit de- 
tailed interviews. 



The initial gamertag and age screening in the survey did not hint 
at the subsequent parental consent requirement for under 18s; thus, 
misreporting of age for this group may have been an automatic 
response to being queried for their age. However, participants 
could not be prohibited from refreshing the survey, reentering their 
gamertag and then misreporting their age. 
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Leader frequency 

Figure 2. Reported frequency of playing a leader role on a 1-5 scale 
where 1 is never and 5 is always, by the preferred playing style. 

players and the 27.5 hours per week reported by the industry 
in 2007 for video game players in general [41]. 

Psychometrics 

We also asked participants to report their preferred playing 
style: team leader or team support (both collaborative styles) 
or "lone wolf" (a style that does not coordinate actions with 
the team). The survey did not provide definitions of these 
labels to respondents; however, feedback from our expert 
players indicated that their meanings should have been com- 
monly understood within the Reach community. Although 
the popular stereotype of FPS games is non-collaborative, we 
find that 78.6% of players prefer playing as a team, in ei- 
ther the leading or supporting roles. Thus, Reach players are 
in fact strongly motivated by the collaborative aspects of the 
game [37,43]. When asked how often they played as a leader, 
participants reported typically playing their preferred role, al- 
though interestingly the lone wolves report playing a leader 
much more often than we would naively expect, given the 
anti-social nature of the lone wolf style (Fig. 2). 

We find some support for the popular perception that younger 
players prefer the non-collaborative lone wolf role [24] . In 
our sample, people who preferred the team support role were 
indeed older (23.5 years) than those who preferred to play as 
lone wolves (21.6 years; p < 0.001). The modest size of the 
difference, however, suggests that while the effect is real, it is 
only a very weak tendency, with many younger players pre- 
ferring collaborative roles and many older players preferring 
non-collaborative ones. Interestingly, people who preferred 
to play as leaders were also significantly younger (21 years) 
than those who played team support (p sa 0). 

Using our expertise as Reach players, we modified three clas- 
sic psychometric surveys intended to assess group dynamics: 
entativity (how much a group is like a single entity), cohe- 
sion (how tightly-knit a group is), and conflict (how much 
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Proportion of Wins 
Figure 3. Proportion of games won by our respondents compared to 
randomly selected players 

internal conflict a group has). Past work on collaboration 
and team performance suggests that low-levels of conflict 
within a group of friends should correlate with better perfor- 
mance [21,22,35]. 

The entativity portion of the survey contained 14 questions, 
which had reasonable variability (SD=0.976 on a 5-point 
scale) and good internal consistency (Cohen's a — 0.82). We 
therefore averaged the responses to create a single entativity 
metric. The average perceived entativity was 3.93 out of a 5- 
point scale, indicating respondents typically felt their teams 
were good at acting as one. The portion of the survey on 
cohesiveness contained 7 questions, which exhibited similar 
variability (SD=0.972) and internal reliability (a = 0.83), so 
again we averaged the responses to create a single metric. In 
agreement with the perceived entativity, players typically felt 
their teams were very cohesive (4.1 out of 5). Finally, the 
portion of the survey on group conflict consisted of 8 ques- 
tions with slightly lower variability (SD=0.92) and less inter- 
nal reliability (a = 0.76), which appears to be related to the 
presence of reverse-scored questions. Nonetheless, we felt 
the consistency was sufficient and averaged the scores. Here, 
perceived conflict was low (1.77), but somewhat higher than 
might be expected given the high entativity and cohesion rat- 
ings. 

Comparison to Random Players 

Advertising the anonymous survey on Halo-oriented commu- 
nity websites likely produced a biased sample of respondents 
relative to the overall population of Reach players. To charac- 
terize how different our respondent population was from the 
overall player population, we downloaded a uniformly ran- 
dom 1/1000 sample of the first 400,000,000 Reach games 
from which we extracted 963,000 unique gamertags. 3 We 
then downloaded each of their player detail files and esti- 
mated the background distributions of various player sum- 
mary statistics. Within this population, the average propor- 



" The API does not provide a method by which to directly select a 
uniformly random player. 
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Figure 4. Time spent playing Halo:Reach by survey participants versus 
randomly selected players 

tion of wins exhibited by a player is almost exactly 1/2, indi- 
cating a fairly unbiased sample of all players (Fig. 3). 

Compared to the background distribution, our respondents 
were much better players, winning many more of their games 
(Fig. 3) than the typical Reach player. In addition to being 
more successful players, our respondents also typically in- 
vested almost 10 times more actual time playing Reach than 
the typical player (Fig. 4). That is, our respondents were seri- 
ous Halo players, likely devoting a large share of their overall 
time spent playing video games to this particular game. 

Demographics and Game Play 

Although popular FPS stereotypes suggest that demographic 
variables like player sex may correlate with game perfor- 
mance, we find that males and females behave and perform 
similarly. For instance, males and females played roughly the 
same number of games (Female = 1825.2, Male = 2085.0), 
and the same amount of time per game (Female = 592.3, 
Male = 620.69; total seconds played divided by total games 
played). However, women died slightly more often than men 
(Female = 10.69 deaths/game, Male = 8.86 deaths/game, 
£(1189) = 5.91, p < 0.007). 4 Of course, because of the rel- 
atively small number of female players (5.1% or 61 respon- 
dents), there may be other small but real effects that we are 
unable to detect. 

We also observed interesting differences in game play by 
player age. Congruent with the popular stereotypes of FPS 
gamers, the older a respondent was the more kills he had per 
game (/3 = 0.263, £(1189) = 4.03, p « 0), as shown in Fig- 
ure 5. We also see older players were significantly more capa- 
ble in the lone wolf role, a result that runs contrary to the sug- 
gestion that young players prefer the lone wolf role because 
they are more capable in the role — popularly attributed to 
faster reflexes — while older players must coordinate in teams 
in order to effectively compete [24] . 
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Figure 5. Kills per game (top) and betrayals per game (bottom) by age 
group, showing that the oldest group has significantly more kills per 
game than the younger groups while the youngest has more betrayals 
per game than the older groups. 

One surprising age-correlated behavioral difference is in the 
number of betrayals (killing one's own teammate, which re- 
sults in a loss of a point to the team and a longer respawn time 
for the offending player), as shown in Figure 5. Younger play- 
ers (age < 18) showed a disproportionate amount of this type 
of team disloyalty relative to the older players (/3 = 0.0024, 
£(1189) = —4.71, p « 0), with the former group exhibit- 
ing a betrayal rate about 40% higher than the latter groups. 
Because betrayals result in a penalty against the team's over- 
all score, this suggests younger players exhibit significantly 
increased anti-social behavior while older players generally 
exhibit a more pro-social or cooperative orientation. 

Psychometrics and Game Play 

Unlike purely survey-based studies, the access granted 
through the API to the detailed in-game data allows us to pair 
each survey response with rich independent behavioral data 
on the same individuals. Through this pairing, we may quan- 
titatively verify reported variables on in-game behaviors. If 
the survey responses are accurate, they should predict corre- 
sponding patterns within the behavioral data, and allow us to 
subsequently predict survey responses from behavioral data 
alone. For instance, participants who report preferring the 
lone wolf role do indeed have significantly fewer "assists" to 
teammates (in which two or more team members collaborate 
to score a point) than participants who prefer to be team lead- 
ers (p sa 0) or team support (p < 0.005). 

We also find meaningful differences in the reported group dy- 
namic psychometrics. For instance, the more a player sees 



their group as an entity, the more assists they have per game 
(r = 0.13, p s=s 0). Surprisingly, the opposite is true for 
number of kills: the more entitative groups have significantly 
fewer kills per game (r = —0.16, p f=a 0), perhaps because 
these groups travel as a pack and thus share their kills among 
their number. There are no significant differences in the num- 
ber of deaths per game, however, suggesting perhaps that such 
groups are no better at warding off attacks than less entitative 
groups. 

We observed a similar effect for perceptions of 
cohesiveness — more cohesive groups had significantly 
fewer kills per game (7- — —0.1, p < 0.001), and there were 
no differences in the number of deaths per game. We did not, 
however, see the same relationship between cohesiveness 
and assists that we did for entativity and assists, suggesting 
the measures may be tapping different aspects of the players' 
subjective perceptions of their groups. A player going out of 
their way to assist a teammate, and thereby potentially giving 
up opportunities to score their own points, may require a 
greater sense of unity than mere closeness or camaraderie. 

Studies of team dynamics in the management literature [21, 
22,35] suggest that groups with greater levels of internal con- 
flict should exhibit different game play dynamics than groups 
with lower levels. Surprisingly, however, we found no strong 
or notable relationships between responses about group con- 
flict and game play. There are several possible explanations 
for this pattern, but we do not explore them here. 



PREDICTING FRIENDSHIPS 

Before we consider the question of whether ground-truth 
friendship labels can be predicted purely from in-game be- 
havioral data, we briefly discuss a few additional results from 
the survey data regarding friendships. 

Of the 1191 respondents, 597 played at least one game with 
another survey respondent. 5 This overlap allows us to test 
whether respondents' perceptions of their online and offline 
friendships are reciprocal [30], i.e., when one player labels 
another as a friend, that friend also labels the player as a 
friend. For online friendships, the reciprocity is 36.9%; that 
is, when one of two survey respondents indicated the other 
was an online friend, about a third of the time did they agree 
they were friends. The agreement on offline friendships was 
higher (60.9%). While both of these values may seem low, 
both are in fact much higher than the rates observed in other 
online social networks, e.g., Twitter (22%) [17]. For a social 
network where friendship ties represent a significant mutual 
investment of time, it is not clear why we do not see rates ap- 
proaching 100%, although there are a few possible reasons. 
For instance, we notice that there are a disproportionate num- 
ber of respondents who list exactly 29 online friends, and this 
could indicate a number of respondents did not realize the list 
of teammates was scrollable. If this is the case, online and of- 
fline friends who fell "under the fold" may have been missed. 
Another explanation may be variability in the interpretation 
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Mutual games 

Figure 6. Number of mutual games played with players declared as on- 
line and offline friends. Dashed lines represent offline friends, and blue 
lines represent online friends, so the solid blue line is an online friend 
who is not an offline friend. 



of the term "online friend" or actual differences in friendships 
of Reach players. This is a question for future research. 

If players choose to structure their online activities in Halo: 
Reach around opportunities to play with friends, then the 
friendship labels respondents provided to us should be pre- 
dictable from in-game behavioral data alone. Logistically, 
friends often synchronize their play times, at the same time of 
day or day of the week; this leads each player's gamertag to 
appear in the other player's game files in highly non-random 
ways. Within Reach, if friends are to play together, they must 
use the in-game "party" mechanism, in which one player 
"joins" another player to form a larger unit. Once they have 
formed this unit, the players are automatically placed together 
by the matchmaking algorithm into the same new competition 
and almost always on the same team (though splitting a party 
can happen, which leads to interesting in-game behavior; see 
below.) Thus, players who are friends will tend to appear 
together in a sequence of games and, within each sequence, 
they will typically be on the same side of a competition. 6 

We find significant differences in the frequency of playing 
together when a participant labeled a co-player as an online 
friend, offline friend, both, or neither. Pairs of players who 
are both online and offline friends typically played the most 
together, followed by online but not offline friends, then of- 
fline but not online friends and finally strangers (Fig. 6). That 
is, friends of both types play many more games together than 
do strangers. 



3 This overlap is likely attributable to the way we advertised the sur- 
vey and is not representative of the background population. 



By default, Reach will party up all players within a particular game 
at its conclusion, after a 10 second delay. To prevent this, a party 
or player must "back out" and restart the matchmaking algorithm. 
Most players choose to back out most of the time. 
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Frequencies of Series Lengths 
Figure 7. Number of series played for online and offline friends. Left panel shows series lengths for co-players who are not offline friends, right 
panel shows series lengths for offline friends. Dashed lines are online friends, solid lines are co-players who are not online friends. Colored lines show 
distributions for different minimum number of games played in a series. 



Additionally, the party mechanism implies that when friends 
play together, their gamertags should appear together in a se- 
ries of games, and therefore these series should be a good pre- 
dictor of friendship. We define a series to be a sequence of 
consecutive games played together with no more than a one- 
hour gap between consecutive games. (Other gap sizes yield 
similar results to those reported here, indicating the robust- 
ness of this feature.) For each pair of players, we measured 
the number of series greater than length n and the length of 
the longest series. Figure 7 shows that online friends were 
significantly more likely to have played series of games to- 
gether, as were offline friends. The greatest number of such 
series was played by pairs who were both online and offline 
friends. Similarly, the longest series for online and offline 
friend pairs is significantly longer than the longest series for 
non-friend pairs. 

To illustrate this point, consider the average longest series. 
For strangers, the average was only 1 .25 games in a row, indi- 
cating a strong tendency for these groups to dissolve quickly. 
However, for pairs of players in which either labeled the other 
as an online friend, the average was 10.20 and for those la- 
beled as both online and offline friends, it was 13.15 games 
(corresponding to roughly two hours of clock time). That is, 
friends play many more games together in a sequence (on av- 
erage, greater than 8 times more) than do strangers, and the 
structure of the party mechanism in Reach implies that this 
behavior must be actively selected for by players. 

These patterns suggest that certain purely behavioral features 
may be highly accurate predictors of unknown friendship la- 
bels. Recovering this kind of hidden information has been 
done with a range of other electronic data on social inter- 
actions, including mobile phone call records [12, 13], geo- 
graphic coincidences [11] and email [9], but not previously 
interactions within online games. 

Given the number of competitions two players played and 
the number of those that were played together, the length of 
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False positive rate 

Figure 8. ROC curve for predicting whether a player is considered an 
online friend, given the number of mutual games together, the longest 
series of games, and the number of series of different lengths. 



the longest series, and how many series of different length 
the pair played together (here: {2, 4, 6, 8, 10}, although other 
values could also be used), we try to predict whether either 
player indicated the other was an online friend (for those 
cases where multiple teammates responded to the survey). 
We fit the data to a logistic regression model, and obtain an 
accuracy of 98.7%, representing an AUC of 0.96. The corre- 
sponding ROC curve can be seen in Figure 8, and the ranking 
of the features can be seen in Table 1 . 

The dominant predictive feature in this model is clearly the 
length of the longest series the two players played together. 
The number of mutual games is also a strong positive fea- 
ture, though it's relevance is tempered by the total number 
of games played. The high accuracy of the model indicates 
friendships in Reach can be reliably inferred from online be- 
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Friends on Team 

Figure 9. Average kill-death ratio, for playing with 0-3 friends. Baseline 
is the population average kill-death ratio (1), so even with no friends on 
the team participants are 65% better than average. 

havior using simple heuristics. Given the fairly general nature 
of the predictive features, we expect these results will gen- 
eralize to other online environments where players have the 
option of partying, or otherwise preferentially playing with 
friends. As a caveat, our labeled friendships are biased to- 
ward players with long game histories, and the most predic- 
tive features leverage this deep behavioral data. To what de- 
gree friendships can be inferred using only short behavioral 
histories is an important open question. 

FRIENDSHIP AND PERFORMANCE 

Because we have all competition data for each player and the 
participant-provided friendship labels of who they were play- 
ing with, we can investigate whether a player performs quan- 
titatively differently when they play with or without friends. 
For the task of predicting winners based on the number of 
friends on one's (or one's opponent's) team, we take two sim- 
plifying steps. First, we focus on 4-on-4 games, which are the 
majority of Reach competitions; this controls for the number 
of players on each team and eliminates the need to differ- 
entiate between number of friends on a team or proportion of 
friends on a team. Second, we aggregate a player's online and 
offline friends; if a player labeled another as either an online 
or an offline friend, we labeled that person as a "friend". 

Individual Performance 
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Figure 10. Assist and betrayal frequency, for playing with 0-3 friends 



Here we ask whether the presence of friends on one's own or 
on one's opponent's team affects that player's performance. 
In other words, do people play or perform differently when 
they have friends on their team? If friendships impact the 
success of collaboration, then we expect to see performance 
improve as a player collaborates with more friends. 

A strikingly clear result is that the amount of cooperation and 
defection (measured as assists and betrayals) a player exhibits 
depends strongly on the number of friends on their team. 
Specifically, the more friends on one's team, the more one as- 
sists (pro-social) and the less one betrays (anti-social) one's 
teammates (Fig. 10). The implication is that players actively 
adjust their play based on their friendships — the motivation 
to maintain these relationships is greater than the motivation 
to maintain harmony within the current team or to win the 
current competition. 

Playing with friends also impacts the player's success in the 
game, i.e., their performance. Winning and losing a competi- 
tion is a team outcome (see below), but a measure of individ- 
ual performance is the ratio of a given player's kill to deaths, 
i.e., the number of points they personally scored versus the 
number of points they personally provided to the opposing 
team. A higher individual value indicates better individual 
performance; by definition, the population average kill-death 
ratio must be 1 .0, as there is one death for every kill. Figure 9 
shows that even when the survey respondents are playing with 
no friends on their team, they do better than the average by 
65%, indicating that our survey population is highly skilled. 
However, this already strong performance further increases 
with each additional friend on the team, up to an additional 
20% when the entire team is a group of friends. Thus, at least 
for our respondents, having friends on one's team has a real 
and direct positive effect on one's own performance, indepen- 
dent of whether the team goes on to win the competition. 

Team Performance 



Estimate Std. Error z value Pr(>|z|) 
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Table 1. Parameters in a logistic regression predicting whether a player 
is labeled an online friend 



Table 2. Best-fitting parameters for logistic regression model predicting 
whether a player won a game based only on the number of online or 
offline friends on their own team and on their opponent's team. 
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Friends on Opponent's Team 
Figure 11. The frequency of betrayals on opponent's team when the 
team consists 0-4 of the player's friends 

Given the effect of friends playing together on the individual's 
frequency of assists and betrayals, it is not too surprising that 
we see a similar pattern across the entire team. That is, inde- 
pendent of the player's own assists and betrayals, the team's 
assists increase, and its betrayals decrease, with the number 
of friends playing together. 

As mentioned above, the matchmaking algorithm sometimes 
places members of a party on the opposing team, which al- 
lows us to investigate what happens when a player must 
choose between helping their teammates versus helping their 
friends (on the opposing team). In this case, we find an inter- 
esting pattern: as the number of friends on the opposing team 
increases, so too does the number of betrayals on that team. 
This trend continues until there is a majority of friends on the 
other team, at which point the number of betrayals decreases 
again (Fig. 1 1). In other words, if one or two friends find 
themselves playing against their friends, they are much more 
likely to kill their (non-friend) teammates than if they had no 
friends on the opposing team. The flip in the trend indicates 
that when a party's loyalties are divided across two teams, the 
smaller subset of friends is the one that defects against their 
teammates. 

Similar to prior experimental work [2], and in agreement 
with our results on the kill-death ratio above, we find that 
players win more often when playing with friends than with 
strangers, and the more friends they play with the better 
(Fig. 12). Notably, this effect appears despite the match- 
making algorithm's effort to minimize differences in team 
skill [18] and thereby decrease the predictive power of non- 
game features like friendship. That is, the TrueSkill match- 
making algorithm used by Reach attempts to control for the 
effect of skill in improving performance, but it does not ac- 
count for the performance benefits of from friendship. 

We fit a logisitic regression model using only information 
about how many online and offline friends were on a player's 
own team and on the opposing team. The corresponding pa- 
rameters are shown in Table 2. The model achieves an accu- 
racy of 6 1 . 1 %, with an AUC of 0.573. We note that predicting 




Friends on team 
Figure 12. Proportion of wins when playing with 0-3 friends 

the outcome of competitions using only the number of friends 
on a team is a very weak signal — being friends cannot make 
bad players good — so it is in fact highly meaningful that there 
is any improvement in accuracy over random chance (AUC of 
0.5). 

We also tried to predict a competition's winner based on a 
rough metric of the players' expertise on each team. Because 
we do not have the entire game history of all players, we are 
unable to recreate the TrueSkill [18] estimates of the players' 
skill level. Moreover, despite the matchmaking algorithm's 
efforts to equalize skill between teams, there is still consid- 
erable variability in skill levels across competitions. Instead 
we use reasonable proxies for a player's skill in Reach, in- 
cluding the average number of games played by members 
of each team, the average progress in the single-player cam- 
paign and co-operative campaign, and the campaign difficulty 
level. This model performs slightly better than the friends- 
only model, improving the accuracy to 62.2% and the AUC 
to 0.614. As can be seen in Table 3, more games played on 
one's own team is only slightly more important than fewer 
games played by one's opponents. 

Combining the expertise and friendship information provides 
a marginal improvement over expertise alone, although both 
the level of expertise and the number of friends on a team 
contribute to the model's prediction of the outcome. The three 
most heavily weighted parameters are the expertise on one's 
own team, the opposing team's expertise, and the number of 
online friends on one's own team (Table 4). The model has a 
negligible improvement in accuracy, to 62.4%, but the AUC 
improves slightly to 0.625. 

CONCLUSION 

In this paper we investigated the relationship of player demo- 
graphic variables, such as age, sex and group cohesion, with 
player behavior, and the impact of friendships on individual 
and team performance in complex competitive environments. 
Toward this end, we used the online game Halo: Reach, one 
of the most popular games of 2010 [14] and one of the few in 
the FPS genre that has been studied in this way, as a model 
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Table 3. Parameters for the logisitic regression model predicting wins 
based on the average number of games played by one's own teammates 
and one's opponents. 



system. Our results only scratch the surface of this large and 
multi-faceted online system to inform our understanding of 
collaboration and competition in virtual environments. 

Our anonymous online survey produced a number of insights 
into the Reach community, many of which likely generalize 
to the FPS genre as a whole. First, our respondents both in- 
vested more time in the game — but not more time than a typ- 
ical video game player spends on games of all types — and 
were significantly more skilled than the typical Reach player. 
Contradicting the anti-social stereotype of FPS players, our 
respondents were strongly motivated by social factors. More 
than 99.5% of our respondents indicated that they played with 
at least one online friend, and the typical number was 21 on- 
line friends (and 4 offline friends). 7 Nearly 80% preferred to 
play socially, on a team, rather than individually as a "lone 
wolf," and most players felt a strong sense of group cohe- 
siveness and coordination. Thus, friendships among Reach 
players play an important role in choices of play style, game 
engagement and investment. 

Although the typical age of our respondents was close to 20 
years old, the distribution shows a long tail, with a large num- 
ber of players over 30. Age correlated with several interesting 
aspects of game play: older players (24 or older; 30%) tend to 
exhibit somewhat less within-group conflict, exhibit greater 
pro-social tendencies (e.g., fewer betrayals), and are slightly 
more skilled (more kills per game). This latter point is coun- 
terintuitive given that younger players often have greater free 
time in which to invest in the game. We found only small dif- 
ferences in play statistics between male and female respon- 
dents and predictive models trained to predict sex from these 
values scored no better than chance. Thus, male and female 
players seem nearly indistinguishable from their in-game be- 
havior. 

Both team and individual performance in Halo: Reach are 
improved by friendship variables and teams composed of 
friends win more games on average than teams composed of 
strangers. However, if overall skill correlates across friend- 
ship ties, then highly skilled groups of friends could tend to 
win more often than groups of strangers because the skilled 
friends are more skilled than an average stranger. That is, skill 
could be homophilous. However, the TrueSkill matchmaking 
algorithm, which actively attempts to eliminate skill differ- 
ences between teams, likely mitigates the impact of this con- 
founding factor. TrueSkill does not control for the benefits 
that comes from playing with friends — benefits we observe 
quite clearly (Fig. 12). Admittedly, our study design does not 
allow us to identify the social mechanism by which friend- 
ships improve performance. A followup study that includes 
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7 But, the degree distribution does not follow a power law [10]. 



Table 4. Parameters for logistic regression predicting winners based on 
expertise and number of friends on one's own and one's opponent's team 



additional survey questions designed to distinguish among 
candidate mechanisms, and perhaps in depth interviews, may 
illuminate the answer. 

That being said, our results do suggest some interesting clues. 
For instance, we observed more assists and fewer betrayals 
of teammates when there were more friends on the team, 
in agreement with research showing friendships decrease 
within-group conflict [21]. Alternatively, the large amount 
of time friends spend playing together may improve the divi- 
sion of specialized roles and improve communication within 
the team, leading to a better coordinated effort [22, 33]. 

Overall, friendships not only improve performance, but they 
also reshape the style of play within competitions. That is, 
players compete differently when they play with friends than 
when they play with strangers. For instance, groups of friends 
who are split across teams have to choose whether to com- 
pete against their friends (cooperate with their teammates) or 
defect against the teammates (cooperate with their friends). 
When this happens, friends tend to defect against their team- 
mates, illustrated by a nearly double betrayal rate when two 
friends are on the opposing team. That is, friendship ties dom- 
inate teammate ties, and players sacrifice their own compet- 
itive success to help their friends. Similarly, betrayal rates 
(both by individuals and by teams) decrease as the number 
of friends on the team increase, illustrating a significant pro- 
social effect as compared to teams of strangers. 

Although our logistic regression models found that friendship 
variables make marginal improvements over skill variables 
in our ability to predict success, this is understandable. In a 
context in which skill matters, a team of unskilled friends is 
sure to do more poorly than a team of skilled strangers. The 
important point is that even among skilled players, friendship 
ties provide a lift in both individual and team performance. 

To conclude, we found a number of interesting relationships 
between in-game behavior within Halo.Reach and player de- 
mographic, group covariates and friendship ties. Most gen- 
erally, we found that friendships strongly influence the social 
dynamics and collaboration within complex virtual environ- 
ments like the FPS game Halo: Reach. Friendships lead to 
improved individual and team performance, increased pro- 
social behavior, and likely increased long-term appeal of and 
engagement in the game. 

The impact of friendships is sufficiently strong, with players 
actively structuring their in-game activities around opportu- 
nities to play with friends, that hidden or unknown friendship 
ties can in fact be correctly inferred directly from behavioral 
time series using common sense heuristics. That is, it should 



be possible to automatically identify pairs of individuals who 
would call each other friends, if asked, merely by watching 
the way they interact with each other online. Collaborative 
systems, including online FPS games like Reach but also a 
wide variety of other systems, that can infer and account for 
such friendship ties could thus enable a wide range of better 
outcomes by providing fewer mismatches between individu- 
als or by suggesting individuals to interact with in the future. 
Given the increasing ubiquity of social gaming and ad hoc 
competition in general, the importance of such algorithms is 
likely significant. 
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